home *** CD-ROM | disk | FTP | other *** search
-
- WWW Search Engines
- //////////////////
-
- Web ROBOTS (info searchers and finders)
-
- CheckWeb
- http://www.stuff.com/~bcutter/home/programs/checkweb.html
- A useful robot that checks your web docs for dead links.
-
- JumpStation
- http://www.stir.ac.uk/jsbin/js
- Robot search engine for locating sites and docs on Web
-
- Lycos
- http://lycos.cs.cmu.edu/
- Robot catalog of Web, gopher and ftp sites
-
- MOMspider WWW94 paper
- http://www.ics.uci.edu/WebSoft/MOMspider/WWW94/paper.html
- Roy Fielding's description of MOMspider and how other bots work
-
- SG-Scout home page
- http://www-swiss.ai.mit.edu:80/~ptbb/SG-Scout/SG-Scout.html
- Another Web catalog robot - runs every few months to update info
-
- WWW Robots, Wanderers and Spiders
- http://web.nexor.co.uk/mak/doc/robots/robots.html
- Place for info on Web robots. Includes list of known bots on WWW
-
- World Wide Web Wanderer Index
- http://www.netgen.com/cgi/wandex
- A searchable index of over 28,000 documents from over 14,000 sites
-
- WWW growth-bot
- http://www.netgen.com/info/growth.html
- - robot that measures the size of the Web
- ---------------------------------------------------------------------------
-
- Web searchers
- /////////////
-
- The WebCrawler
- http://www.biotech.washington.edu/WebCrawler/WebQuery.html
- ----------
-
- The WebCrawler II
- http://webcrawler.cs.washington.edu
- ----------
-
- The WWW Worm
- http://www.cs.colorado.edu/home/mcbryan/WWWW.html
- ----------
-
- GNN Home page
- http://gnn.com/gnn/gnn.html
- Has a huge list of home pages from around the web
- Look in the Netizens area which lists individual home pages.
- ----------
-
- Yahoo server
- Home page URL:
- http://akebono.stanford.edu/yahoo/
-
- Although originally set up as a starting point for web surfers, this page
- has come into its own and now offers a huge database of over 40,000 web
- pages that are part of a searchable index. The database searchable list
- of web pages keeps growing and there is no charge for the use of this
- searching utility. The index is searchable by web page title and keyword
- so there is a good chance that what you are looking for is here. Once found
- the user can automatically link to that page and away you go!
- ---------------------------------------------------------------------------
-
- Usenet Searchers
- ////////////////
-
- Stanford University Electronic Library
- http://sift.stanford.edu
-
- Lets you monitor Usenet newsgroups by entered keywords
- ---------------------------------------------------------------------------
-
- Dictionary Look-Up
- //////////////////
-
- WWW
- http://c.gp.cs.cmu.edu:5013/prog/webster
- http://www.ai.mit.edu/people/wessler/dict
-
- Gopher
- gopher://gopher.niaid.nih.gov:70/77/deskref/.Dictionary/enquire
- gopher://knot.queensu.ca:17502/1webster
- ---------------------------------------------------------------------------
-
- The Open Text Web Index
- http://www.opentext.com
-
- [See the following URL for more info on web search tools]
- http://cuiwww.unige.ch/meta-index.html
-
- The Open Text Web Index, while still under heavy construction, is
- now available for general use. This Web search engine has currently
- indexed about half a million pages (http, gopher, ftp), and intends,
- in the near future, to index over 2 million pages and over 1
- billion words of text. The following features of the Web Index
- distinguish it from the many other other web searchers.
-
- 1. 100% of the full text of every page is indexed
- 2. Boolean and ranked search are supported
- 3. Updated every night in an effort to track the web as closely
- as possible
- 4. Prefix, word, and phrase search all run at the same (high) speed
- 5. Hosted by a big-league Internet Service Provicer (UUNET Canada),
- so there's lots of bandwidth
- 6. It's **FREE***
- 7. Real-time Key-Word-In-Context display of match points, to avoid
- following URL's to check up on every match.
- 8. No preset number of match returns; if you find 845 pages, and have
- the patience, the Index will scroll through them all.
- ---------------------------------------------------------------------------
-